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The NIST/NASA/CARB Biological Macro- 
molecule Crystallization Database (BMCD), 
NIST Standard Reference Database 21, 
contains crystal data and crystallization con- 
ditions for biological macromolecules. The 
database entries include data abstracted 
from published crystallographic reports. 
Each entry consists of information describ- 
ing the biological macromolecule crystal- 
lized and crystal data and the crystallization 
conditions for each crystal form. The 
BMCD serves as the NASA Protein Crystal 
Growth Archive in that it contains protocols 
and results of crystallization experiments 
undertaken in microgravity (space). These 
database entries report the results, whether 
successful or not, from NASA-sponsored 
protein crystal growth experiments in mi- 



crogravity and from microgravity crystal- 
lization studies sponsored by other interna- 
tional organizations. The BMCD was 
designed as a tool to assist x-ray crystallog- 
raphers in the development of protocols to 
crystallize biological macromolecules, those 
that have previously been crystallized, and 
those that have not been crystallized. 
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1. Introduction 

The three-dimensional structures of biological macro- 
molecules have become essential elements for guiding 
studies and developing an understanding of biological 
processes. The coordinates provide essential informa- 
tion for protein engineering, rational drug design, and 
other biotechnology efforts. X-ray crystallographic stud- 
ies of biological macromolecules provide a mechanism 
of attaining this information. The diffraction studies re- 
quire the production of large single crystals of the bio- 
logical macromolecule of interest. An example of a crys- 
tal of ribonuclease S suitable for crystallographic studies 
is shown in Fig 1. 

The solution properties of a biological macro- 
molecule, determined by factors such as size, shape, 



surface complexity, and conformational stability, di- 
rectly relate to its ability to crystallize. A number of 
recent studies of the biophysical properties of biological 
macromolecules to discover simple methods that would 
have predictive capabilities have been prompted by 
recognition of these facts (i.e., Refs. [1-2]). Tradition- 
ally, empirical procedures consisting of a series of 
experiments that vary different parameters such as pH, 
temperature, ionic strength, and macromolecule concen- 
tration are carried out (for a recent review see Ref. [3]) 
to avoid the current lack of understanding of the crystal 
growth process needed to predict crystallization behav- 
ior. The number of experiments required for success is 
dependent on the macromolecule and the choices made 
by the investigator. If crystallization occurs over a broad 
range of conditions, or if the right choices are made in 
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the initial experiments, the search ends quickly. In other 
cases many experiments are required to discover crystal- 
lization conditions, and occasionally no crystallization 
conditions are found, regardless of how many experi- 
ments are done. 




Fig. 1. A photomicrograph of a crystal of bovine ribonuclease S 
grown aboard the Space Shuttle Flight STS-65 during the July 8th 
mission. The crystal was grown in a vapor diffusion reactor of the 
Advanced Protein Crystallization Facility at 20 °C. The droplet con- 
tained 25 mg/mL protein, 0.7 M ammonium sulfate, 1.8 M sodium 
chloride, and 0.05 M sodium acetate at pH 5.0. This was equilibrated 
against a reservoir solution composed of 1.4 M ammonium sulfate, 3.6 
M sodium chloride, and 0.1 M sodium acetate, pH 5.0. 



Although several systematic procedures and strategy 
suggestions have been put forth [4-11], no universal 
strategy for searching for the crystal growth parameters 
for a biological macromolecule has gained widespread 
acceptance. Many laboratories have recently begun 
using experimental procedures called "fast screens" 
[12] that use sparse matrix sampling techniques [6]. 
These procedures employ a set of experiments that scan 
a wide range of pH and a variety of precipitants, buffers, 
and additives that have proven successful in other crystal 
growth studies. These strategies are based on the 
successes of many scientists, that have produced suitable 
crystals for diffraction studies for a variety of macro- 
molecules. 

The development of a productive crystallization strat- 
egy may yield crystals, but the internal order of a crystal 
dictates the quality of x-ray diffraction limiting the in- 
formation that can be obtained from the ensuing crystal- 
lographic studies. An improvement in the diffraction 
quality of the crystals will not only facilitate the struc- 
tural investigation, but also directly influence the 



accuracy of the coordinates obtained from the structural 
investigation. Recently, a number of crystallization ex- 
periments undertaken in space indicate microgravity has 
a positive influence on the crystal growth process, i.e., 
an improvement in the diffraction quality of crystals of 
biological macromolecules [13]. Because of these find- 
ings, several laboratories throughout the world are 
investigating the role of gravity in the crystallization 
process. Many additional crystallization experiments 
will be required to discover the reasons for the observed 
differences in the diffraction resolution observed 
between earth and space grown crystals. 

The Biological Macromolecule Crystallization Data- 
base (BMCD), besides cataloging and summarizing the 
information concerning crystallization that is available 
in the literature, now includes the National Aeronautics 
and Space Administration (NASA) Protein Crystal 
Growth Archive. A description of the data and software 
that constitute the BMCD and the NASA Protein Crystal 
Growth Archive along with illustrations of how the 
BMCD can be used to address the needs of macro- 
molecular crystallographers in the search for crystalliza- 
tion protocols is presented here. 

2. The Biological Macromolecule Crystal- 
lization Database (BMCD) 

In 1989, with assistance from the National Institute 
of Standards and Technology Standard Reference Data 
Program, the NIST/CARB (Center for Advanced Re- 
search in Biotechnology) BMCD database was created. 
The menu driven software used to access the data is 
written and compiled with Clipper'-^ and runs as an 
independent program on personal computers (PCs). All 
database files are in dBase III Plus^ format, and all of 
the index files are in the Clipper indexing format. The 
initial data consisted of the crystallization conditions of 
1,025 crystal forms of 616 biological macromolecules. 
This information, which includes most of the crystal- 
lization protocols of biological macromolecules in the 
literature through the end of 1982, had been deposited 
earlier in the Brookhaven Protein Data Bank [14]''. 



' Certain commercial equipment, instruments, or materials are identi- 
fied in this paper to foster understanding. Such identification does not 
imply recommendation or endorsement by the National Institute of 
Standards and Technology, nor does it imply that the materials or 
equipment identified are necessarily the best available for the purpose. 
^ Clipper is a registered trademark of Nantucket Corp., 12555 West 
Jefferson Boulevard, Los Angeles, CA 90066. 

' dBASE HI Plus is a registered trademark of Borland International 
Inc. 
* GiUiland, unpublished data (1983). 
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A second version of the NIST/CARB BMCD soft- 
ware and data was released in 1991 [10]. This release 
included data for 1,465 crystal forms of 924 biological 
macromolecules. In 1994 the BMCD became the NASA 
Protein Crystal Growth (PCG) Archive and began in- 
cluding data from crystal growth studies supported by 
NASA [15]. The software was expanded and released as 
the NIST/NASA/CARB BMCD Version 3.0 including 
data for 2,218 crystal forms of 1,465 biological macro- 
molecules through 1994. 

Recently, Version 3.0 of the BMCD has been ported 
to a UNIX platform to take advantage of the develop- 
ment of network capabilities that employ client-server 
tools [16]. This implementation of the BMCD uses the 
POSTGRES database management system [17] and 
World Wide Web/NCSA Mosaic client-server protocols 
[18]. The network version provides most of the features 
of the earlier PC versions of the BMCD. It also gives the 
user community access to the most recent updates, and 
it allows the rapid incorporation of new features and 
capabilities of the software. 

3. BMCD Data 

The BMCD includes crystallization data for peptides, 
proteins, protein-protein complexes, nucleic acids, 
nucleic acid-nucleic acid complexes, protein-nucleic 
acid complexes and viruses for which diffraction quality 
crystals have been obtained and reported in the litera- 
ture. In addition, the BMCD, serving as the NASA 
Protein Crystal Growth Archive, contains the crystal- 
lization data generated from ground-based and micro- 
gravity crystallization studies supported by NASA. 
Crystallization data from microgravity experiments 
sponsored by other international space agencies are also 
included. 

The information contained in the BMCD is divided 
into three major categories, biological macromolecule, 
crystallization, and summary data. Each biological 
macromolecule included in the database has a unique 
biopolymer sequence. For example, a mutant protein 
with a single amino acid change from the wild type 
protein is considered a separate macromolecule entry. 
Analogously, a crystal reported in the literature must 
have undergone a preliminary x-ray crystallographic 
study and have unique unit cell constants. Crystal entries 
that are nearly isomorphous with previously reported 
crystals are included if the author(s) describes signifi- 
cant differences in the intensity distribution of the 
diffraction pattern from that previously reported. The 
BMCD data elements for each category mentioned 
above are described below. Many data items, which are 
not appropriate for the biological macromolecule or 



crystal form or which may be missing from the 
published report, are not included in the data entries. 

3.1 Biological Macromolecule Entry 

The entries for each biological macromolecule are 
composed of data elements that provide a unique de- 
scription of the biopolymer or biopolymer complex. The 
data elements include the preferred name, the systematic 
name, and other aliases of the macromolecule. Each 
entry contains biological source information that con- 
sists of the common name, genus-species, tissue, cell, 
and organelle from which the macromolecule was iso- 
lated. Attempts have also been made to include this 
information for recombinant proteins expressed in a 
foreign host. The subunit composition and molecular 
weight are present. This information consists of the total 
number of subunits and the total molecular weight, and 
the number and corresponding molecular weights of 
each type of distinct subunit associated with the macro- 
molecule. A subunit of a biological macromolecule is 
defined as a part of the in vivo assembly associated with 
other parts by noncovalent interactions. For example, 
the two monomers of a dimeric enzyme and the two 
oligomeric nucleic acid strands of a double-stranded 
nucleic acid fragment are considered in both cases as 
two subunits. The name of a prosthetic group associated 
with a biological macromolecule is included if it is 
reported in the crystallographic studies. If the macro- 
molecule is an enzyme, the EC number [19] and cata- 
lytic reaction are provided. Finally, general remarks are 
included for cases where special features of the macro- 
molecule are important, or might influence, its crystal- 
lization behavior. A four character alphanumeric identi- 
fier beginning with the letter M is assigned to each 
biological macromolecule entry. A macromolecule entry 
for an enzyme, bovine ribonuclease A, is illustrated in 
Fig. 2. 

3.2 Crystal Entry 

Each crystal form of a particular biological macro- 
molecule characterized by diffraction studies is de- 
scribed as a crystal entry. The data in each crystal entry 
includes the crystallization procedure, the crystal data, 
crystal morphology, and complete references. All of the 
details required to reproduce the crystallization proce- 
dure are data elements of the crystal entry. These ele- 
ments consist of the crystallization method, the macro- 
molecule concentration, the temperature, the pH, the 
chemical additives to the growth medium (buffer, 
precipitant and/or stabilizers), and the length of time 
required to produce crystals of a size suitable for diffrac- 
tion experiments. If the crystallization deviates from 
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Macromolcculc Entry: M03C 
Molecule Name: ribonuclease A 
Alias(cs): ribonuclease (pancreatic) 

hydrolase, pyrimidinooligonucleotido-. ribonucleate 2',3'- 
Source Common Name: bovine 

Tissue: pancreas 
EC Number: 3.1.27.5 

Catalytic Reaction: endonucleolytic cleavage to 3'-phosphomononucleotides and 
3'-phosphooligonuclcotides ending in C < p > or U <p > with 2',3'-cyclic phosphate intermediates 
Total Molecular Weight: 13683 
Total No. Subunits: 1 

Remarks: 2':3'-cyclic phosphates are formed as intermediates; formerly EC 2.7.7.16 and 3.1.4.22 
(124 aa). 
Number of Crystal Forms: 17 

Fig. 2. A representative example of the biological macromolecule entry M03C for ribonuclease A in 
the BMCD. 



Standard protocols [7], an outline of the procedure is 
provided in the comments. The crystal size and shape 
are given along with the resolution limit of x-ray diffrac- 
tion as described in the reference. References are pro- 
vided for published crystal photographs or diffraction 
pictures. The crystal data include the unit cell dimen- 
sions (a, b, c, a, 13, y), the space group, the number of 
molecules in the unit cell (Z), and the crystal density. 
Data entries are also cross referenced to other structural 
biology databases such as the Brookhaven Protein Data 
Bank [14]. Each crystal entry is also given a four charac- 
ter alphanumeric identifier beginning with C. A crystal 
entry for the macromolecule entry illustrated in Fig. 2 is 
shown in Fig. 3. 

3.3 Summary Information 

The summary information gives the user a convenient 
mechanism for browsing the BMCD data. The user has 
access to a complete list of macromolecule names, tabu- 
lations of the number of macromolecules and crystal 
forms for each source, prosthetic group, space group, 
chemical addition, crystallization method, and a listing 
of complete references. The panels in Figs. 4a and 4b 
illustrate part of the summary information for the 



chemical addition and crystallization method tabula- 
tions. 

Facilities are provided for comprehensive searches of 
the reference information. References can be queried for 
matches with a particular author, a key word or phrase. 
The BMCD also provides a listing of general references 
concerning all aspects of crystal growth. These refer- 
ences have been divided into categories that include 
reviews and books, articles concerning procedures, and 
references concerning nomenclature. Most references 
include complete titles, and remarks are often added to 
emphasize important aspects of a reference that may not 
be evident from the title. 

4. The NASA PCG Archive 

As briefly mentioned above, a new function of the 
BMCD is to serve as the NASA PCG Archive. A com- 
plete description of the biological macromolecule 
crystallization experiments performed in microgravity 
in the NASA Space Shuttle and in ground control exper- 
iments is included in the BMCD. In addition, the 
BMCD includes data and results from microgravity 
crystallization experiments sponsored by other interna- 
tional space agencies. The data are organized in a 
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Co'stal entr>: C05L 



a b c alpha beta gamma Z Space Group CS 

30.28 38.39 53.16 90.00 105.83 90.00 4 P2<1> Monoclinic 

Method(s) used for the crystallization: batch method 

Macromolecule concentration: 25.000 mg/mL 

Temperature: 25.0 degrees C 

pH: 5.000 

Chemical additions to the crystal growth medium: 
tertiary butanol 50.0000 % 

Diffraction limit: 1.500 Angstroms 

Reference's designation of this crystal form: II 

Cross reference to Protein Data Bank: 5RSA 3RN3 

Comments: Crystals arc usually grown with seeding. ROZO - reports the structure of cytidylic acid 

(2'-CMP) bound to ribonuclease A at 2.3 A resolution. 

References: 



R04Y: 
R142: 

R0Z6: 



R141: 

R04W: 
ROZO: 



Bernal, JD; Carhsle, CH; Rosemeyer, MA (1959) Acta Crystallogr, 12,227-231. 

Borkakoti, N (1983) Eur J Biochem, 132(1), 89-94. "The active site of ribonuclease 

A from the crystallographic studies of ribonuclease- A-inhibitor complexes." 

Borkakoti, N; Palmer, RA; llaneef I; Moss, DS (1983) J Mol Biol, 169(3), 743-755. 

"Specificity of pancreatic ribonuclease-A. AnX-ray study of aprotein-nucleotide 

complex." 

Carlisle, CH; Palmer, RA; Mazumdar, SK; Gorinsky, BA; Yeates, DGR (1974) J 

Mol Biol, 85, 1 -18. "The structure of ribonuclease at 2.5 Angstrom resolution." 

Fankuchen, I (1945) Advan Protein Chem, 2, 387-405. 

Howlin, B; Harris, GW; Moss, DS; Palmer, RA (1987) J Mol Biol, 196(1), 159-164. 

"X-ray refinement study on the binding of cytidylic acid (2'-CMP) to ribonuclease 

A." 



Fig. 3. The BMCD cyrstal entry C05L, a representative example of one of the 17 crystal forms on 
ribonuclease A macromolecule entry M03C in the BMCD. It should be noted that the concentrations of 
chemical additives use four decimal places to accommodate the range of values of concentrations of the 
database, not to indicate the precision of the data. 
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Fig. 4a. A display of the BMCD summary information for chemical 
additions to the crystallization solutions. 



Fig. 4b. A display of the BMCD summary information for crystal- 
lization methods. 
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similar way as other entries in the BMCD. The data 
comprising the PCG Archive mentioned above contains 
biological macromolecule and crystal entries as de- 
scribed above with additional data items that relate to 
the microgravity experiment(s). In fact, in the PCG 
Archive a single crystal form of a macromolecule may 
have multiple entries, one for each microgravity experi- 
ment. Each of these crystallization entries is also given 
a four character alphanumeric identifier beginning with 
the letter C. An example of a microgravity crystal entry 
for a ribonuclease A (entry M03C, Fig. 2) experiment is 
shown in Fig. 5. 



As part of the NASA archive, new summary displays 
of information specific to the microgravity crystalliza- 
tion experiments have been added to the BMCD. 
Included are lists of the apparatuses used in micrograv- 
ity experiments, of the microgravity missions, of the 
acronyms and names of sponsors, and of references 
discussing theory or experimental results of crystalliza- 
tion of biological macromolecules in microgravity. 
Detailed descriptions of the crystallization apparatuses 
that are in use for microgravity crystallization experi- 
ments and that have grown crystals are also available. 



Microgravity data for crystal: CICQ 

Flown on MASER3 10/04/89 to 10/04/89 

Time at microgravity: 7.25 minutes 
Equipment used on fliglit: Protein Crystal Growth module 
Number of experiments flown with these crystallization conditions: 1 

Investigators: Sjolin, L(leader); Wlodawer, A; Bergqvist, G; Holm, P; Loth, K; Malmstrom, H; 
Zaar, J; Svensson, LA; Gilliland, GL 

Comments on microgravity experiments: More and larger crystals grew at microgravity than 
during ground based experiments. For such short flights seed crystals were found to be required; 
very few crystals grew in those chambers where seeds were not added. 



Crystal entry CE#: C 1 CQ Microgravity Data 

a b c alpha beta gamma Z Space Group CS 

30.18 38.40 53.32 90.00 105.85 90.00 4 P2<1> Monoclinic 
Method(s) used for the crystallization: macroseeding, see comments 
Macromolecule concentration: 70.000 mg/mL 
Temperature: 18.0 to 20.0 degrees C 
pH: 5.300 
Chemical additions to the crystal growth medium: 



2-methyl-2-propanol 
sodium hydroxide 



30.0000 
0.0000 



% 



Diffraction limit: 1.000 Angstroms 

Crystal dimensions in mm: 1.0 0.4 0.5 

Reference for the photograph or drawing: RlIX 

Reference's designation of this crystal form: microgravity 

Comments: 3.5 mL of the protein solution containing 30% 2-methyl-2-propanol was placed in the 

crystallization cylinder. 1 .5 mL of 60% 2-methyl-2-propanol solution was placed in the plunger 

together with one or more seed crystals which had been washed several times in 60% alcohol 

solution. 

References: 

RlIX: Sjolin, L; Wlodawer, A; Bergqvist, G; Holm, P; Loth, K; Malmstrom, H; Zaar, J; Svensson, 
LA; Dill, J; Gilliland, GL (1992) European Space Agency SP-1132, 2, 92-103. "Protein 
Crystals Grown On Board MASER 3 Extended the Ribonuclease A Structure to 1 .06 A 
Resolution." 

RlIZ: Sjolin, L; Wlodawer, A; Bergqvist, G; Holm, P; Loth, K; Malmstrom, H; Zaar, J; Svensson, 
LA; Dill, J; Gilliland, GL (1991) "Structure, Mechanism and Function of Ribonucleases", 
IBF Publications, Universitat Autonoma de Barcelona, Bellaterra, Spain, CM Cuchillo, R 
de Llorens, MV Nogues, X Pares (eds), 39-50. "Protein crystals grown aboard MASER 3 
extend the ribonuclease A structure to 1 .06 A resolution." 

RIIY: Svensson, LA; Sjolin, L; Dill, J; Gilliland, GL (1991) "Structure, Mechanism and Function 
of Ribonucleases", IBF Pubhcations, Universitat Autonoma de Barcelona, Bellaterra, Spain, 
CM Cuchillo, R de Llorens, MV Nogues, X Pares (eds), 31-38. "The conformational 
flexibility of surface residues of bovine ribonuclease A at 1 .1 A resolution." 

Fig. 5. A representative example for a microgravity crystallization entry (CICQ) of ribonuclease A 
(M03C) in the BMCD. 
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5. Querying the BMCD 

Both the PC and WWW versions of the BMCD 
provide user interfaces for querying the database. Data- 
base searches may be carried out for data elements of 
any of the categories listed in Table 1 . Depending upon 
the element selected, a parameter value (numerical or 
text) or a range of values (numerical) is requested. The 
interfaces provide mechanisms that allow the use of the 
Boolean logic AND and OR functions to generate quite 
complex searches. The results of searches can be dis- 
played in a variety of ways and can be written to an 
ASCII file or printed. 

6. Crystallization Strategies 

From its inception the BMCD has been designed as a 
tool to assist in the development of crystallization strate- 
gies for biological macromolecules. These strategies 
range from reproducing published procedures to crystal- 
lizing a biological macromolecule that is unrelated to 
any that has previously been crystallized [10]. Discussed 
below are four general categories of problems that 
present themselves. These include developing strategies 
for the crystallization (1) of a previously crystallized 
biological macromolecule, (2) of a modified or mutant 
biological macromolecule for which the unmodified or 
wild-type biological macromolecule has been reported, 
(3) of a biological macromolecule that is homologous to 
previously crystallized macromolecule(s), and (4) of a 
biological macromolecule that was not previously crys- 
tallized. 

6.1 Previously Crystallized Macromolecules 

The BMCD contains the information to reproduce the 
crystallization conditions for biological macromolecules 



reported in the literature. The reported crystallization 
conditions are the starting points to initiate the crystal- 
lization trials. The crystallization of the biological 
macromolecule may be quite routine, but differences in 
the isolation and purification procedures, reagents, and 
crystallization methodology of laboratories can influ- 
ence dramatically the reproducibility, and therefore, the 
successful outcome. The crystallization conditions in 
the database should be considered as an initial starting 
point that may require experiments that vary pH, macro- 
molecule and reagent concentrations, temperature, along 
with the crystallization method. The protocol used to 
produce crystals for the structure determination of 
bovine chymosin [20] represent the adaptation of a pro- 
cedure published many years earlier [21]. The modified 
procedure used the hanging drop vapor diffusion proce- 
dure [22] that was not being practiced when the first 
crystal growth studies were reported. 

6.2 Variant Macromolecule or Ligand-Biological 
Macromolecule Complexes 

The structures of sequence variants, chemically mod- 
ified macromolecules, or ligand-biological macromole- 
cule complexes are becoming routine structure determi- 
nation problems for crystallographic laboratories. 
Frequently the altered biopolymer sequence, chemical 
modification, and presence of the ligand do not interfere 
with crystal packing interactions, significantly alter the 
conformation, or influence the solution properties of the 
macromolecule. If this is the case, the crystallization 
conditions of the biological macromolecule may be sim- 
ilar to the wild type, unmodified or the "free" biological 
macromolecule. This may be true for even quite radi- 
cally modified variants such as the nine residue, high- 
affinity calcium site deletion mutant of subtilisin BPN' 
[23]. This variant crystallized under virtually identical 
conditions with that of the wild type molecule. 



Table 1. The BMCD search parameters 



Macromolecule entry 






Macromolecule name 


Biological source 


Molecular weight 


Subunit composition 


Prosthetic group 


Multiple crystal forms 


Crystal entry 






Crystal data 


Crystallization conditions 


Reference 


Space group 


Crystallization method 


Author 


Unit cell dimensions 


Macromolecule concentration 


Year 


Z, molecules/unit cell 


Crystallization Temperature 


Journal 


Crystal density 


pH of crystallization 
Crystal growth time 
Chemical additions 


Database ID 
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6.3 Homologous Biological Macromolecules 

Frequently, crystallographic studies are initiated with 
a biological macromolecule related through sequence 
(and presumably structural) homology to a set of macro- 
molecules that have previously been crystallized. The 
BMCD can be used to tabulate the crystallization condi- 
tions for the members of the family, and then this infor- 
mation can be used to limit the parameters for the initial 
crystallization trials with the biological macromolecule. 
For example an examination of crystallization condi- 
tions for the single-chain acid proteinases listed in 
Table 2 indicate that most crystals are grown using a 
variety of the precipitants: ammonium sulfate, lithium 
sulfate, ethanol, polyethylene glycol 4000, and poly- 
ethylene glycol 8000. One half of the crystal forms listed 
in Table 2 utilize ammonium sulfate suggesting that this 
would be the reagent to use in the initial trials. The 
protein concentration ranges from 2 mg/mL to 50 mg/ 
mL, the pH ranges from 1.5 to 6.3, and the temperature 
ranges from 4 °C to 26 °C. These reagents and parame- 
ters would then represent a logical starting point for 
initiating a limited set of crystallization trials. If no 
crystals were obtained using this information, the more 
general procedure outlined next would be warranted. 

6.4 General Crystallization Procedure 

The use of the BMCD has been incorporated into 
general procedures required for the crystallization of a 
unique biological macromolecule that has never been 
crystallized [9, 10]. A general procedure for a soluble 
recombinant protein is illustrated in Fig. 6. Once a 
protein is cloned, expressed, and purified, it is concen- 
trated (if possible) to at least 10 mg/mL, and preferably 
to a concentration of at least 25 mg/mL or higher. It is 
then dialyzed into 0.005 to 0.025 M buffer at neutral pH 
or at a pH required to maintain solubility of the protein. 
Other stabilizing agents such as EDTA and dithiothreitol 
may be included at low concentrations to stabilize the 
biological macromolecule during the crystallization tri- 
als. Once this has been accomplished, two different 
types of experimental procedures are carried out con- 
currently, one employing the vapor diffusion method and 
the other microdialysis. 

Vapor diffusion experiments are set up after selecting 
from 3 to 10 of the reagents that have been most suc- 
cessful at inducing crystallization (Fig. 4a) for soluble 
proteins. The data in the BMCD is also used to set the 
limits for parameters such as pH and temperature. A 
distribution of the number of crystal forms for all biolog- 
ical macromolecules versus pH and temperature is 
shown in Figs. 7a and 7b, respectively. For the procedure 



illustrated in Fig. 6, three different pH's are selected, 
4.0, 6.0 and 8.0 and experiments are done at both room 
and coldroom temperatures, approximately 20 °C and 
6 °C, respectively. Before actually setting up the vapor 
diffusion experiments, small aliquots of concentrated 
buffered reagents (0.1 M buffer) are added to a small 
droplet (10 (xL) of the protein in a stepwise manner to 
determine the protein's precipitation point [4]. After 
each addition of 0.5 (xL to 1.0 (xL of the reagent, the 
droplet is sealed in an airtight chamber and observed 
after 10 min. If precipitation occurs, distilled water can 
be added to the droplet to attempt to redissolve the 
protein. These titration experiments set the limits of 
reagent concentrations for the vapor diffusion experi- 
ments. These should be done at both room and coldroom 
temperatures. 

Once the precipitation points have been found, sets 
of vapor diffusion experiments covering the range of 
reagents and pH conditions are initiated at both room 
and coldroom temperatures. Small droplets of protein 
(5 (xL) are mixed with an equal volume of a reservoir 
solution and placed in a sealed chamber that allows the 
droplet to equilibrate with 0.5 mL to 1.0 mL of the 
reservoir solution. At the selected pH, successive exper- 
iments are initiated that increase the precipitant concen- 
tration to a value greater than that observed in the titra- 
tion experiment. For example, if a protein precipitates at 
50 % saturated ammonium sulfate, experiments that 
equilibrate droplets against 44 %, 47 %, 50 %, 53 %, and 
55 % saturated ammonium sulfate. An equivalent set of 
vapor diffusion experiments with protein solution con- 
taining an effector (a ligand, cofactor, metal ion, etc.) 
can be carried out in parallel. 

Besides the precipitant (or crystallizing) reagents 
mentioned above, an analysis of the data contained in the 
BMCD reveals that about 10 % of the soluble proteins 
crystallize at low ionic strength (< 0.2 M). Thus, the 
procedure outlined in Fig. 6 indicates that a set of micro- 
dialysis experiments equilibrating the protein solutions 
against low ionic strength should be carried out for an 
unknown protein. Initially, experiments over a pH range 
of 3.0 to 9.0 in steps of 0.5 to 1.0 should be carried out, 
again, at both room and coldroom temperatures. It is 
also worthwhile to do microdialysis experiments at or 
near the protein's isoelectric point, a point at which a 
protein is often the least soluble. As with the vapor 
diffusion experiments mentioned above, the introduction 
of small quantities of ligands, products, substrate, sub- 
strate analogs, monovalent or divalent cations, organic 
reagents, etc., to the crystallization mixtures may facili- 
tate crystal growth. Thus, it is recommended that a 
second set of experiments be run in parallel to those 
described above. 
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Fig. 6. A general strategy for finding the crystallization conditions for a recombinant protein 
based on the data in the BMCD. 
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Fig. 7a. The distribution of the number of crystal forms in the 
BMCD as a function of pH. 



After initiating the experiments above periodic as- 
sessment of the experiments is done. The time for crys- 
tals to appear ranges from a few hours to many months, 
but usually if there is no success within three weeks, 
further crystallization trials are suggested. For the vapor 
diffusion experiments, additional reagents should be se- 
lected from the BMCD summary and the titration car- 
ried out before setting up a new set of experiments. 
Alternatively, a finer increment in pH and/or other tem- 
peratures should be considered. For the microdialysis 
experiments additional experiments at a finer increment 
should be carried out. For both the vapor diffusion and 
microdialysis experiments other effectors or additives 
should also be considered. 




-10-1 0-4 S-9 10-14 IS -19 20-24 2S -29 SA .34 35-39 40-100 

Temperature, °C 

Fig. 7b. The distribution of the number of crystal forms in the 
BMCD as a function of temperature. 

If crystals are discovered in either the vapor diffusion 
or the microdialysis experiments, the parameters that are 
influencing crystal growth should be optimized. This 
requires small perturbations in the pH, temperature, and 
the reagent and protein concentrations. At this point 
seeding experiments should also be considered to im- 
prove crystal size or reproducibility. Even if crystals are 
large enough for diffraction studies, optimization may 
be necessary to improve the diffraction quality. If the 
crystals diffract well, the three-dimensional structure 
determination can be initiated, but if the crystals do not 
diffract, or diffract poorly, the search should be contin- 
ued as mentioned above. 
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7. Future Developments of the BMCD 

The primary goal of the BMCD will continue to be a 
complete, error-free, and up-to-date crystallization data- 
base and NASA PCG Archive. The availability of the 
BMCD on the Internet will simplify user reporting of 
errors and distribution of new data. The current version 
of the software is being expanded to include alternate 
ways for users to display the data. Because of the BMCD 
being one of several biological and crystallographic 
databases on the internet, direct links to related entries 
in other databases are being considered. Finally, soft- 
ware tools for users engaged in crystal growth studies 
are being developed that will assist the user in deciding 
what experiments are appropriate and in designing the 
experiments for a particular biological macromolecule. 
These tools incorporate the principles outlined in the 
discussion of crystallization strategies outlined above. 
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